Exploration of Noncoding Sequences in Metagenomes
نویسندگان
چکیده
Environment-dependent genomic features have been defined for different metagenomes, whose genes and their associated processes are related to specific environments. Identification of ORFs and their functional categories are the most common methods for association between functional and environmental features. However, this analysis based on finding ORFs misses noncoding sequences and, therefore, some metagenome regulatory or structural information could be discarded. In this work we analyzed 23 whole metagenomes, including coding and noncoding sequences using the following sequence patterns: (G+C) content, Codon Usage (Cd), Trinucleotide Usage (Tn), and functional assignments for ORF prediction. Herein, we present evidence of a high proportion of noncoding sequences discarded in common similarity-based methods in metagenomics, and the kind of relevant information present in those. We found a high density of trinucleotide repeat sequences (TRS) in noncoding sequences, with a regulatory and adaptive function for metagenome communities. We present associations between trinucleotide values and gene function, where metagenome clustering correlate with microorganism adaptations and kinds of metagenomes. We propose here that noncoding sequences have relevant information to describe metagenomes that could be considered in a whole metagenome analysis in order to improve their organization, classification protocols, and their relation with the environment.
منابع مشابه
Study of Long Noncoding RNA FER1L4 and RB1, as Its Competing Endogenous RNA Network Target Gene, in Breast Cancer
Introduction: Breast cancer is the second most common cause of cancer-related death among females, which requires an exploration for markers to propose a more specific categorization of this cancer. Long noncoding RNAs (lncRNAs), the main subset of noncoding transcripts, are involved in tumorigenic processes. In this study, we investigated the expression of the fer-1–like family member 4 (FER...
متن کاملA highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes
Metagenomics, or sequencing of the genetic material from a complete microbial community, is a promising tool to discover novel microbes and viruses. Viral metagenomes typically contain many unknown sequences. Here we describe the discovery of a previously unidentified bacteriophage present in the majority of published human faecal metagenomes, which we refer to as crAssphage. Its ~97 kbp genome...
متن کاملEvolutionary analysis of enzymes using Chisel
MOTIVATION Availability of large volumes of genomic and enzymatic data for taxonomically and phenotypically diverse organisms allows for exploration of the adaptive mechanisms that led to diversification of enzymatic functions. We present Chisel, a computational framework and a pipeline for an automated, high-resolution analysis of evolutionary variations of enzymes. Chisel allows automatic as ...
متن کاملQuantitative Evaluation of the Lateral Sealing Ability of Normal Faults in Siliciclastic Sequences: Implication for Fault Trap in Well Gang 64, in the West Qikou Sag, China
The lateral sealing ability of a normal fault is a major factor in creating hydrocarbon traps. Therefore, a methodology for assessing the sealing ability of faults in the siliciclastic sequences of subsidence basins has been established; moreover, by using this methodology, the uncertainty inherent in hydrocarbon exploration can be decreased. Moreover, the petrophysical properties of fault roc...
متن کاملDevelopment of an Efficient Hybrid Method for Motif Discovery in DNA Sequences
This work presents a hybrid method for motif discovery in DNA sequences. The proposed method called SPSO-Lk, borrows the concept of Chebyshev polynomials and uses the stochastic local search to improve the performance of the basic PSO algorithm as a motif finder. The Chebyshev polynomial concept encourages us to use a linear combination of previously discovered velocities beyond that proposed b...
متن کامل